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estlmat^  In  the  Two-Parameter  Beta  Method  • the  method  of  moments  is  ado- 1 
pted  to  approximate  the  probability  density  function  of  the  maximum  likelihood  I 
estimate,  using  polynomials  of  degree  3 and  A . The  first  two  conditional 
moments  of  the  latent  trait,  given  the  maximum  likelihood  estimate,  are  derived 
from  theory  and  computed  for  the  data  for  each  value  of  the  maximum  likelihood 
estimate.  The  conditional  distribution  of  the  latent  trait,  given  the  maximum 
likelihood  estimate,  is  approximated  by  a Beta  distribution  using  the  method  of 
moments,  with  two  a priori  set  parameters  and  two  estimated  parameters  from  the  ' 

conditional  moments.  Five  scores  of  the  latent  trait  are  calibrated  according 
to  this  approximated  Beta  distribution.  For  each  of  the  ten  binary  Items,  whos4 
Item  characteristic  functions  are  to  be  estimated,  the  set  of  frequency  ratios  • 

of  these  scores  for  the  group  of  subjects  who  have  answered  correctly  to  the  to- 
tal subjects  for  each  of  the  20  intervals  of  the  latent  trait  is  taken  as  the 
estimated  Item  characteristic  function. 

In  the  present  study,  the  Two-Parameter  Beta  Method  is  further  developed  tc 
add  two  more  variations,  and  to  compare  the  result  with  the  one  obtained  in  the 
previous  study.  In  the  first  method,  the  resulting  set  of  calibrated  latent 
trait  scores  for  each  of  the  two  groups,  l.e.,  the  group  of  subjects  who  have 
answered  correctly  to  the  binary  Item  and  those  who  answered  incorrectly.  Is  a- 
pproximated  by  a polynomial  of  degree  3 or  A by  the  method  of  moments.  The 
ratio  of  the  polynomial  thus  obtained  for  the  correct  group  to  the  sum  of  these  * 

two  polynomials  Is  the  estimated  Item  characteristic  function  of  this  binary  1- 
tem.  In  the  second  method.  Instead  of  calibrating  five  latent  trait  scores  for 
each  maximum  likelihood  estimate,  the  whole  conditional  probability  density  fun-; 
ctlon  of  the  latent  trait,  given  the  maximum  likelihood  estimate,  is  used. 

These  density  functions  are  summed  up  for  each  of  the  correct  and  the  total  ' 

groups,  and  the  ratio  of  the  summed  density  functions  for  the  correct  group  to 
those  for  the  total  group  Is  the  estimated  Item  characteristic  function  of  the 
binary  Item.  This  Is  done  for  the  case  In  which  a polynomial  of  degree  3 Is  . 

used  to  approximate  the  set  of  maximum  likelihood  estimates  of  the  500  sub- 
jects, and  also  for  the  case  In  which  a polynomial  of  degree  A Is  used,  and 
they  are  called  Degree  3 Case  and  Degree  A Case  respectively. 

For  the  purpose  of  evaluating  these  results,  the  theoretical  bivariate  den- 
sity function  of  the  latent  trait  and  Its  maximum  likelihood  estimate  is  derlvec 
from  the  uniform  distribution  of  the  latent  trait  and  the  conditional  distribu- 
tion of  the  maximum  likelihood  estimate,  which  Is  approximated  by  the  normal 
distribution  with  the  latent  trait,  6 , and  the  Inverse  of  the  test  informatloi 
function,  1(6)  , as  the  two  parameters.  As  In  the  previous  study,  the  test 

Information  function  assumes  the  constant  value,  21.63  , for  the  Interval  ( 

-2.5  , 2.5)  of  6 . From  this  bivariate  density  function,  the  conditional  densi- 
ty function  of  the  latent  trait,  given  the  maximum  likelihood  estimate.  Is  ob- 
tained for  each  of  the  500  subjects.  These  conditional  density  functions  are 
summed  up  for  the  correct  group  and  the  total  group  for  each  of  the  ten  binary 
Items,  and  the  ratio  of  the  former  to  the  latter  Is  used  as  the  criterion  oper- 
ating characteristic,  in  addition  to  the  frequency  ratio  of  the  correct  group 
to  the  total  group  for  each  of  the  20  Intervals  of  6 with  the  width  of  0.25. 

It  turned  out  that  1)  the  result  of  the  second  method  Is  almost  Identical 
with  the  criterion  operating  characteristic  In  both  Degree  3 Case  and  Degree 
A Case,  and  far  better  than  the  frequency  ratios;  2)  the  result  of  the  first 
method  proves  that  the  approximation  of  the  sets  of  calibrated  latent  trait 
scores,  § * hy  the  polynomial's  of  degree  3 or  A Improves  the  closeness  of 

the  curve  to  the  theoretical  Item  characteristic  function,  to  some  extent,  but 
not  too  much. 

The  least  square  principle  is  used  In  estimating  two  parameters  following 
the  normal  ogive  model,  for  each  result. 
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ESTIMATION  OF  THE  OPERATING  CHARACTERISTICS  OF  ITEM  RESPONSE 
CATEGORlhS  II:  FURTHER  DEVELOPMENT  OF  THE  TWO-PARAMETER  BETA 

METHOD 


ABSTRACT 


The  Two-Parameter  Beta  Method^ Was  Introduced  In  the 
previous  study  as  a method  of  estimating  the  operating  charac- 
teristics of  a test  Item,  has  proved  to  be  as  efficient 

as  the  Normal  Approximation  Method,  for  a set  of  simulated  data 
of  500  hypothetical  examinees  having  a uniform  latent  trait 
distribution  between  -2.475  and  2.475  . Both  methods  are 
characterized  •^y/(l)^ithe  use  of  a relatively  small  number  of 
sub J acts— like  500"ln  the  whole  procedure  of  estimation^  (2)  without 
assuming  any  prior  mathematical  model,'  and  ;^3)^the  use' of  the 
estimated  joint  distribution  of  the  latent  trait  and  Its  maximum 
likelihood  estimate.  In  the  Two-Parameter  Beta  Method,  the 
method  of  moments  Is  adopted  to  approximate  the  probability 
density  function  of  the  maximum  likelihood  estimate,  using 
polynomials  of  degree  3 and  4 . The  first  two  conditional 
moments  of  the  latent  trait,  given  the  maximum  likelihood  esti- 
mate, are  derived  from  theory  and  computed  for  the  data  for  each 
value  of  the  maximum  likelihood  estimate.  The  conditional 
distribution  of  the  latent  trait,  given  the  maximum  likelihood 
estimate.  Is  approximated  by  a Beta  distribution  using  the  method 
of  moments,  with  two  a priori  set  parameters  and  two  estimated 
parameters  from  the  conditional  moments.^  Five  scores  of  the 
latent  trait  are  calibrated  according  to  this  approximated  Beta 
distribution.  For  each  of  the  ten  binary  Items,  whose  Item 
characteristic  functions  are  to  be  estimated,  the  set  of  frequency 
ratios  of  these  scores  for  the  group  of  subjects  who  have  answered 
correctly  to  the  total  dumber  of  subjects  for  each  of  the  20  inter- 
vals of  the  latent  trait  is  taken  as  the  estimated  item  character- 
istic function. 
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In  the  present  study,  the  Two-Parameter  Beta  Method  is 
further  developed  to  add  two  more  variations,  and  to  compare  the 
result  with  the  one  obtained  In  the  previous  study.  In  the 
first  method,  the  resulting  set  of  calibrated  latent  trait  scores 
for  each  of  the  two  groups,  l.e.,  the  group  of  subjects  who  have 
answered  correctly  to  the  binary  Item  and  those  who  answered 
Incorrectly,  Is  approximated  by  a polynomial  of  degree  3 or  4 
by  the  method  of  moments.  The  ratio  of  the  polynomial  thus 
obtained  for  the  correct  group  to  the  sum  of  these  two  polynomials 
Is  the  estimated  Item  characteristic  function  of  this  binary  Item. 

In  the  second  method.  Instead  of  calibrating  five  latent  trait 
scores  for  each  maximum  likelihood  estimate,  the  whole  condi- 
tional probability  density  function  of  the  latent  trait,  given 
the  maximum  likelihood  estimate.  Is  used.  These  density 
functions  are  summed  up  for  each  of  the  correct  and  the  total 
groups,  and  the  ratio  of  the  summed  density  functions  for  the 
correct  group  to  those  for  the  total  group  is  the  estimated  Item 
characteristic  function  of  the  binary  Item.  This  Is  done  for 
the  case  In  which  a polynomial  of  degree  3 Is  used  to  approximate 
the  set  of  maximum  likelihood  estimates  of  the  300  subjects,  and 
also  for  the  case  In  which  a polynomial  of  degree  4 Is  used,  and 
they  are  called  Degree  3 Case  and  Degree  4 Case  respectively. 

For  the  purpose  of  evaluating  these  results,  the  theoretical 
bivariate  density  function  of  the  latent  trait  and  Its  maximum 
likelihood  estimate  Is  derived  from  the  uniform  distribution  of 
the  latent  trait  and  the  conditional  distribution  of  the  maximum 
likelihood  estimate,  which  Is  approximated  by  the  normal  distri- 
bution with  the  latent  trait,  6 , and  the  inverse  of  the  test 
Information  function,  1(6)  , as  the  two  parameters.  As  In 
the  previous  study,  rhe  test  information  function  assumes  the 
constant  value,  21.63,  for  the  Interval  [-2.5,  2.5]  of  6 . 

From  this  bivariate  density  function,  the  conditional  density 
function  of  the  latent  trait,  given  the  maximum  likelihood  estimate, 
la  obtained  for  each  of  the  500  subjects.  These  conditional 
density  functions  are  suaned  up  for  the  correct  group  and  the 


total  group  for  each  of  the  ten  binary  Items,  and  the  ratio  of 
the  former  to  the  latter  Is  used  as  the  criterion  operating 


> characteristic.  In  addition  to  the  frequency  ratio  of  the 

correct  group  to  the  total  group  for  each  of  the  20  Intervals 
, of  6 with  the  width  of  0.25  . 

It  turned  out  that  1)  the  result  of  the  second  method 
Is  almost  Identical  with  the  criterion  operating  characteristic 
In  both  Degree  3 Case  and  Degree  4 Case,  and  far  better  than 
the  frequency  ratios;  2)  the  result  of  the  first  method  proves 
that  the  approximation  of  the  sets  of  calibrated  latent  trait 
scores,  0 , by  the  polynomials  of  degree  3 or  4 Improves  the 
closeness  of  the  curve  to  the  theoretical  Item  characteristic 
function,  to  some  extent,  but  not  too  much. 

The  least  square  orlnclple  Is  used  in  estimating  two 
parameters  following  the  normal  ogive  model,  for  each  result. 


The  research  was  conducted  at  the  principal  Investigator's  labo-  j 

ratory,  409  Austin  Peay  Hall,  Department  of  Psychology,  Univer- 
sity of  Tennessee,  Knoxville,  Tennessee.  The  computer  program- 
ming work  was  greatly  assisted  by  one  of  her  graduate  assistants, 

Yeh  Chlng-Chuan.  The  other  assistants  working  for  her  at  various 

times  Include  Merle  C.  Steelman,  Robert  L.  Trestman,  C.l.  Bonnie  Chen,  j 

Ll-Jen  Jenny  Chen,  and  an  undergraduate  student,  Philip  S.  Livingston.  | 

I 


i 


TABLE  OF  CONTENTS 

I Introduction 

II  Curve  Fitting  Method  for  6 
III  Results 

IV  Conditional  P.D.F.  Method 
V Criterion  Item  Characteristic  Function 
VI  Results 

VII  Discussion  and  Conclusion 


References 

90 

Appendices 

^pendlx  I 

91 

Appendix  II 

116 

Page 

1 

4 

9 

63 

65 

67 

88 


I Introduction 


The  Two-Parameter  Beta  Method  has  been  Introduced  as  a method 
of  estimating  the  operating  characteristics  of  Item  response  categories 
(Samejlma,  1977d).  Just  like  the  Normal  Approv'linatlon  Method.  Its  main 
features  will  be  summarized  as  follows. 

(1)  A relatively  small  number  of  subjects,  e.g.,  500,  are  needed  In 
the  whole  process  of  estimation. 

(2)  No  prior  mathematical  models  are  assumed  for  the  operating 
characteristics . 

Throughout  the  process,  the  method  of  moments  (Elderton  and  Johnson, 
1969,  Johnson  and  Kotz,  1970)  Is  applied  effectively,  and  the  test 
information  function  and  Its  asymptotic  properties  In  connection 
with  the  maximum  likelihood  estimate  (Samejlma,  1977a,  1977b,  1977c) 
are  fully  utilized.  On  the  other  hand,  there  are  some  restrictions  In 
this  method,  l.e.,  1)  a set  of  Items,  whose  operating  characteristics 
are  known.  Is  needed.  In  addition  to  a new  set  of  Items  whose  operating 
characteristics  are  to  be  estimated;  and  2)  the  old  set  of  Items  should 
provide  us  with  a constant  test  Information  function  throughout  the 
range  of  ability,  or  latent  trait,  of  our  Interest.  This  second  re- 
striction is  of  no  serious  obstacle  If  we  use  tailored  testing  data 
with  an  a priori  set  test  information  function  as  the  criterion  for 
terminating  the  presentation  of  new  Items  (Samejlma,  1977a,  In  press  a. 
In  press  b),  although  It  Is  more  difficult  to  realize  In  a paper-and- 
pencll  testing  situation.  A modification  of  this  procedure  to  eliminate 


the  first  restriction  Is  under  consideration.  In  order  to  cover  the 
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situation  where  no  such  old  set  of  Items  Is  available.  In  this  paper, 
however,  the  second  restriction  still  holds. 

The  main  objective  of  this  study  Is  to  prodvice  some  variations 
of  the  Two-Parameter  Beta  Method  without  changing  the  empirical  setting. 
For  this  purpose,  the  same  simulated  data  as  were  used  In  the  previous 
studies  (Samejlma,  1977b,  1977d)  are  used  throughout  the  study,  with 
500  hypothetical  examinees  whose  ability  distributes  uniformly  for  the 
range  of  ability  1-2.5,  2.5],  Thus  the  constant  value  of  the  test  In- 
formation function  of  the  old  set  of  Items  is  21.63,  which  provides 
0.215  for  the  standard  deviation  of  the  conditional  distribution  of 
the  maximum  likelihood  estimate,  given  ability  6 , and  Is  large  enough 

to  approximate  the  conditional  distribution  of  the  maximum  likelihood 
--  2 

estimate  9 by  N(e,  0.215  ).  The  new  Items,  whose  operating  charac- 
teristics are  to  be  found,  are  the  same  ten  binary  Items  which  follow 
the  normal  ogive  model. 

The  present  research  Is  divided  Into  two  parts,  l.e.,  the  esti- 
mation of  the  operating  characteristics  1)  by  fitting  a polynomial  to 
each  set  of  6 , which  are  calibrated  by  the  Monte  Carlo  method  (Same- 

jlma, 197 7d)  and  are  classified  In  terms  of  the  Item  score  of  each  of 
the  ten  binary  Items,  by  the  method  of  moments,  and  taking  Its  ratio  to 
the  sum  of  all  the  polynomials;  and  2)  by  totalling  the  whole  conditional 
probability  densities  of  6 , given  Its  maximum  likelihood  estimate, 

each  of  which  Is  approximated  by  a Beta  density  function  with  two  a 
priori  set  parameters  and  two  other  estimated  parameters  from  the  first 
two  conditional  moments  of  6 , In  terms  of  each  Item  score  category 

of  a new  Item,  and  taking  Its  ratio  to  the  total  sum  of  the  conditional 


density  functions.  For  convenience,  hereafter  the  former  method  will 
be  called  the  Curve  Fitting  Method  ^ 9 in  contrast  to  the  Histogram 
Method  for  9 which  indicates  the  method  introduced  in  the  pre- 
ceding research  report  (Samejlma,  1977d),  whereas  the  latter  method  will 
be  called  the  Conditional  Method,  within  the  context  of  the  TVo— 


Parameter  Beta  Method. 
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II  Curve  Fitting  Method  for  § 

The  simulated  data  used  for  this  part  of  the  research  are  exactly 
the  same  as  those  adopted  In  the  previous  study  of  the  Two-Parameter 
Beta  Method,  up  to  the  calibration  of  2500  values  of  6 . To  summarize: 

(1)  The  set  of  500  maximum  likelihood  estimates,  § , were  gra- 

duated by  a polynomial  by  the  method  of  moments.  In  so  doing, 
both  polynomials  of  degrees  3 and  4 were  used.  The  resulting  poly- 
nomials are: 

(2.1)  0.22416  - 0.003519  - 0.018730^  + 0.000959^ 
and 

(2.2)  0.19620  + 0.002380  + 0.013190^  - 0.000620^  - 0.004270^. 

For  convenience,  whenever  the  first  polynomial  Is  used  for  the 
probability  density  function  of  0 , we  call  It  Degree  3 Case, 

and  It  Is  called  Degree  4 Case  If  the  second  polynomial  Is  used. 

(2)  The  conditional  mean  and  variance  of  0 , given  § , 

are  given  by 

(2.3)  E(9|9)  - 0 + a2  log  g(g) 
and 

(2.4)  Var.  (0|0)  - o'*  log  g(0)  + o2  , 

where  a ■ 0.215  and  g(0)  Is  the  probability  density  function 
of  § approximated  by  one  of  the  polynomials  given  In  (1). 

(3)  The  conditional  distribution  of  e , given  the  maximum  llkell- 

A 

hood  estimate  0 , la  assumed  to  be  a Beta  distribution,  whose 


probability  density  function  Is  given  by 
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(2.5)  ♦(9|9)  - [B(pg.  qg)]"^(9  - a|)‘’S''-(b|  - 

(bj  - aj)-<'>9'^r»  , 

where  ag  and  bg  are  6 - 2.55a^  - 0 - 0.54825  and 

§ + 2.55o^  “ 0 + 0,54825  respectively,  pg  and  qg  are  obtained 
from  the  conditional  moments  by 

(2.6)  p-  - M^(l  - 

and 

(2.7)  qg  - - M^)^m'^  - (1  - Mj^)  , 

where 

(2.8)  - [E(0|0)  - ag]  (bg  - ag)"^ 

and 

(2.9)  - E[{e  - E(0|0)}^  l0]  (bg  - ag)”^  , 

respectively,  and  B(pg,  qg)  is  the  Beta  function  with  pg  and 
q*  as  its  two  parameters. 

(4)  Following  this  approximated  conditional  distribution  of  0 , 

given  0 , five  scores  were  calibrated  for  each  maximum  likeli- 

hood estimate  by  the  Monte  Carlo  method,  and  these  scores  are 
S3rmbolized  by  0 . This  procedure  provided  us  with  2500  0 's 
in  total  in  Degree  3 Case.  In  Degree  4 Case,  however,  this 
number  is  only  2465,  since  we  had  to  discard  seven  hypothetical 
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examlnees,  or  seven  extreme  values  of  the  maximum  likelihood 
estimate,  either  because  of  their  negative  values  of  the  estimated 
g(e)  or  of  their  negative  values  of  the  estimated  conditional 
variance. 

After  this  step,  Instead  of  taking  the  ratio  of  the  frequency 
distribution  of  6 for  the  hypothetical  subjects,  who  gave  the 
correct  answer  to  a new  item  g , to  the  frequency  distribution 
of  the  total  set  of  0 , as  we  did  in  the  Histogram  Ratio  Method, 

a polynomial  of  degree  3 or  4 is  fitted  by  the  method  of  moments, 
to  the  set  of  6 for  the  hypothetical  examinees  who  answered  item 
g correctly  as  well  as  to  the  set  of  e for  those  who  answered 
item  g liKorrectly.  Then  the  two  polynomials  are  added  together 
to  make  a new  polynomial  for  the  total  group  of  examinees,  and 
the  estimated  item  characteristic  function  is  obtained  as  the  ratio 
of  the  polynomial  for  the  "success"  group  of  examinees  to  the 
polynomial  for  the  total  group.  In  so  doing,  again,  both  poly- 
nomials of  degrees  3 and  4 were  used.  For  convenience,  hereafter, 
we  shall  call  these  four  cases  Degree  3-3  Case,  Degree  3-4  Case, 
Degree  4-3  Case  and  Degree  4-4  Case,  respectively,  with  the  first 
number  indicating  the  degree  of  the  polynomial  used  for  g(e)  , 
whereas  the  second  number  indicates  the  degree  of  the  polynomial 
used  in  graduating  the  set  of  6 for  each  of  the  succese  and 
failure  groups. 

The  simple  unweighted  least  square  method  is  used  to  evaluate 
the  result,  by  obtaining  the  estimates  of  the  two  parameters  In 
the  normal  ogive  model,  as  was  done  previously  (Samejime,  1977b). 
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The  model  specifies  the  Item  characteristic  function,  P (e)  , 
for  item  g in  the  form 


(2.10)  P(e) 
8 


-H 

« e ^ du  , 


where  a is  the  discrimination  parameter  and  b is  the 
* g 

difficulty  parameter.  Let  F (e.)  be  the  estimated  value  of  the 

g 3 

item  characteristic  function  for  Che  midpoint  of  Che  J-ch  interval. 


and  define  ( . such  that 


gj 


(2.11)  Pg(e) 


e ^ du  . 


Thus  following  Che  least  square  principle,  we  define  Q by 


m 


(2.U)  20  . - .^<e^  - b^))‘  . 


and  differentiating  Q with  respect  to  a and  b and  setting 

g g 

the  results  equal  to  zero,  we  obtain 
m 


(2.13) 


12;  V“ 


and 

(2.1A) 


12;  ■ ''gj  - - ‘g”  *g  ■ ^ 


The  above  equations  lead  us  to  the  estimates  of  a and  b such 
chat 


(2.15)  a^  - Cov.  (5^,  ej)(Var.  (e^)) 


-1 


and 
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(2.16)  7 - (Cov.  (c  0,))‘W.  (0JT  , 

8 gj  J J g 


where  0 and  5 are  the  means  of  0,  and  C . respectively, 

g j gj 


These  estimates,  a and  b , are  to  be  compared  with  their 

g g 


respective  parameters,  a and  b 

g g 


Throughout  the  present  study,  the  Interval  width  used  for 
the  parameter  estimation  la  0,2  . 
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III  Results 

Figures  3-1  and  3-2  present  the  histograms  of  the  frequency 
distribution  of  9 for  both  the  success  and  failure  groups  of  examinees 
for  each  of  the  ten  binary  items,  and  the  approximated  polynomials  of 
degree  3 and  4 

(3.1)  a + Se  + + 59^ 

and 

(3.2)  o + se  + yS^  + 69^  + ve^  , 

which  were  obtained  by  the  method  of  moments  described  in  the 
preceding  section. 

As  Is  expected,  the  polynomials  of  degree  4 show  a slightly 
better  fit  to  the  histograms  than  the  polynomials  of  degree  3,  but 
overall  the  fit  Is  fairly  good  for  both  types  of  curves.  Another 
conspicuous  feature  of  these  curves  Is  that  outside  the  ranges  of  9 
where  the  histograms  are  located  there  exist,  for  many  items,  rapid 
Increases  and  decreases.  This  warns  us  that  we  should  be  careful  to 
avoid  using  these  meaningless  parts  of  the  polynomials  In  the 
estimation  of  Item  characteristic  functions.  (Note  that  In  these 
graphs  the  two  histograms  for  the  success  and  failure  groups  are  drawn 
In  such  a way  that  the  sum  total  of  the  two  histograms  makes  the 
relative  frequency  distribution  of  § . In  other  vrords,  these  histograms 
are  proportioned  according  to  the  relative  frequencies  of  those  who 
answered  correctly  to  the  Item  and  those  who  did  not,  to  make  the  sum 
of  these  areas  unity.) 


Item 
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FIGURE  3-1  (Continued) 
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FIGURE  3-1  (Continued) 
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(Continued) 
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FIGURE  3-2  (Continued) 


(Continued) 
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Tables  3-1  and  3-2  show  the  coefficients  of  the  polynomials  of 
degrees  3 and  A in  (3.1)  and  (3.2),  for  both  the  success  and  failure 
groups  of  each  of  the  ten  binary  Items.  For  additional  Information, 
these  polynomials  were  converted  to  those  for  the  probability  density 
function  of  each  group,  and  the  converted  coefficients  are  presented  In 
Appendix  I,  as  Table  A-1-1. 


Estimated  Coefficients  of  the  Polynomial  Graduating  the  set  of  6 of  Each  of  the  Failure  and 

Success  Croups  of  Examinees:  Degree  3-3  Case 


0.2070<)  -0.03044"  -0.02141  0.00310  — 0r0l73T~-^.crD5^T"™0T0561^---O-.orD2  7- 


TABLE  3-1  (Continued) 


19036  -0.03170  -0.01255  0. 00360  ' -0. 00 106 ' t D.OO‘^90  ■"  0 .03 1 6 1 ' -0. 006 66  0.01617  -0.005‘>6 
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The  esclmated  item  characteristic  functions  of  the  ten  binary  items 
in  Degree  3 Case  are  presented  as  Figure  3-3,  and  those  in  Degree  4 Case 
are  as  Flgvire  3-4.  In  each  case,  the  approximated  polynomials  of  degree  3 
are  drawn  by  broken  curves,  and  those  of  degree  4 are  drawn  by  dotted 
curves . 

As  Is  expected,  these  curves  pass  through  the  plots  obtained  by 
the  Histogram  Ratio  Method  of  9 In  both  Degree  3 Case  (triangles)  and 
Degree  4 Case  (squares) . We  notice  that  there  are  crooked  parts  at  the 
ends  of  the  curves  for  some  Items,  and  this  suggests  the  effect  of  the 
meaningless  parts  of  the  approximated  polynomials,  which  are  shown  In 
Figures  3-1  and  3-2. 

For  additional  Information,  a similar  procedure  has  been  applied 

for  the  set  of  500  true  B and  also  for  the  set  of  500  maximum 

likelihood  estimates  6 . The  resulting  curves  are  presented  In  Appendix  I 

as  Figures  A- 1-1  and  A-1-2.  Note  In  these  cases  the  results  of  using 

polynomials  of  degree  5 are  added  to  those  of  degrees  3 and  4. 

Tables  3-3  and  3-4  present  the  estimates  of  the  two  parameters  of 

the  normal  ogive  model,  l.e.,  the  discrimination  parameter  a^  and  the 

difficulty  parameter  b , which  were  obtained  by  the  simple  least  square 

8 

method  described  In  the  preceding  section.  In  order  to  avoid  the  effect  of 

the  meaningless  parts  of  the  approximated  polynomials,  a rather  narrow  range 

of  9 , -2.0  through  2.0  , Is  taken  for  the  estimation,  with  the 

« 

subinterval  step  of  0.2  , and  within  this  Interval  the  values  of  the  < 
eatimated  item  characteristic  function  between  0.05  and  0.95  Inclusive 
were  used.  We  can  see  that  the  results  of  the  present  method  are  as  good 
as  those  obtained  from  the  true  9 In  both  parameters.  Since  the  strict 


-36- 


restrlctlon  on  the  range  of  9 has  resulted  in  reducing  the  number  of 

points  used  for  the  estimation,  and  indeed  for  item  1 on  the  true  9 using 

the  polynomials  of  degree  4 there  is  only  one  point  and  the  estimates  are 

not  obtainable,  similar  parameter  estimates  using  the  range  of  the  estimated 

item  characteristic  function  0.01  through  0.99  inclusive.  Instead 

of  0.05  through  0.95  , were  obtained,  and  presented  as  Tables  3-5  and 

3-6.  In  these  tables,  we  can  see  that  the  results  of  the  present  method 

are  even  better,  compared  with  those  for  the  true  9 , especially  in  the 

estimation  of  a^  . For  an  additional  information,  the  corresponding 

results  obtained  by  using  the  interval  of  9 between  -3.0  and  3.0 

inclusive  are  presented  as  Tables  3-7  and  3-3.  We  notice  that  the 

estimates  of  the  present  method  are  much  worse,  especially  for  the 

discrimination  parameter  a , which  indicate  the  effect  of  the  crooked 

S 

parts  of  the  estimated  item  characteristic  functions. 

The  comparison  of  the  results  of  the  present  method  with  the 

corresponding  results  by  the  Histogram  Ratio  Method  of  the  Two-Parameter 

Beta  Method  for  § Indicates  that,  as  a whole,  there  is  some  improvement 

in  the  estimation  of  the  discrimination  parameter  a^  , but  no  visible 

improvement  in  the  estimation  of  the  difficulty  parameter  b (cf.  Samejima, 

S 

1977d,  Tables  7-1,  7-2,  A- 3-1  and  A- 3-2). 


There  are  some  mistakes  in  the  plotting  of  the  triangles,  which  represent 
the  results  of  Degree  3 Case  by  the  Histogram  Ratio  Method,  in  Figure  7-9 
of  the  Research  Report  77-1.  Please  refer  to  Figure  3-3  of  this  research 
report  for  corrections. 
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Estimated  Item  Characteristic  Functions  by  the  Curve  Fitting  Method  for  0 , in 
Degree  3-3  Case  (Broken  Curve)  and  Degree  3-4  Case  (Dashed  Curve),  with  Those  Ob 
tained  by  the  Histogram  Ratio  Method  for  0 in  Degree  3 Case  (Triangles)  and 
Degree  4 Case  (Squares)  and  the  True  Item  Characteristic  Function  (Solid  Curve) 
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FIGURE  3-3  (Continued) 
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Estimated  Item  Characteristic  Functions  by  the  Curve  Fitting  Method  for  9 , In 
Degree  4-3  Case  (Broken  Curve)  and  Degree  4-4  Case  (Dashed  Curve),  with  Those  Ob- 
tained by  the  Histogram  Ratio  Method  for  6 in  Degree  3 Case  (Triangles)  and 
Degree  4 Case  (Squares)  and  the  True  Item  Characteristic  Function  (Solid  Curve) 
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FIGURE  3-4  (Continued) 
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FIGURE  3-4  (Continued) 


TABLE  3-3 

Discrimination  Parameter  and  Its  Estimates  of  the  Ten  Binary  Items  Obtained  by  the  Curve-Fitting 
Method  for  6 , l.e..  Degree  3-3,  3-4,  4-3  and  4-4  Cases,  for  6 (3  Cases)  and  for  the  Maximum 

Likelihood  Estimates  § (3  Cases);  The  Range  of  the  Frequency  Ratios  Used  Is  [0.05,  0.95],  with- 

in the  Interval  of  0 , [-2.0,  2.0]. 


The  number  of  Intervals  used  In  estimation  Is  shown  as  a subscript  when  It  Is  less  than 


TABLE  3-4 

Difficulty  Parameter  and  Its  Estimates  of  the  Ten  Binary  Items  Obtained  by  the  Curve-Fitting  Method 
for  6 , l.e..  Degree  3-3,  3-4,  4-3  and  4-4  Cases,  for  6 (3  Cases)  and  for  the  Maximum  Likelihood 

Estimates  9 (3  Cases);  The  Range  of  the  Frequency  Ratios  Used  Is  [O.OS,  0.95],  within  the  Interval 


The  number  of  Intervals  used  In  estimation  Is  shown  as  a subscript  when  It  Is  less  than 


TABLE  3-5 

Discrimination  Parameter  and  Its  Estimates  of  the  Ten  Binary  Items  Obtained  by  the  Curve-Fitting 
Method  for  0 , l.e..  Degree  3-3,  3-A,  4-3  and  4-4  Cases,  for  6 (3  Cases)  and  for  the  Maximum 

Likelihood  Estimates  § (3  Cases);  The  Range  of  the  Frequency  Ratios  Used  Is  [0.01,  0.99),  with' 

In  the  Interval  of  6 , [-2.0,  2.0] 
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number  of  Intervals  used  In  estimation  Is  shown  as  a subscript  when  It  Is  less  than 


Difficulty  Parameter  and  Its  Estimates  of  the  Ten  Binary  Items  Obtained  by  the  Curve-Fitting  Method 
for  9 , i'S’*  Degree  3-3,  3-4,  4-3  and  4-4  Cases,  for  6 (3  Cases)  and  for  the  Maximum  Likelihood 

Estimates  6 (3  Cases);  The  Range  of  the  Frequency  Ratios  Used  Is  [0.01,  0.99],  within  the  Interval 
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The  number  of  intervals  used  in  estimation  Is  shown  as  a subscript  when  it  is  less  than 


TABLE  3-7 

Discrimination  Parameter  and  Its  Estimates  of  the  Ten  Binary  Items  Obtained  by  the  Curve-Fitting 
Method  tor  6 , l.e.,  Degree  3-3,  3-4,  4-3  and  4-4  Cases,  for  0 (3  Cases)  and  for  the  Maximum 

Likelihood  Estimates  6 (3  Cases):  The  Ranee  of  the  Freauencv  Ratios  Used  Is  fO.05.  0.951.  with' 


number  of  intervals  used  in  estimation  is  shown  as  a subscript  when  it  is  less  than 
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IV  Conditional  P.D.F.  Method 

. One  of  the  reasons  why  0 was  calibrated  by  the  Monte  Carlo 

method  to  multiply  the  data  In  the  preceding  study  (Samejima,  1977d) 
is  to  compare  the  results  with  those  obtained  by  the  Normal  Approxim- 
ation Method.  Another  reason  is  to  compare  them  directly  with  the 
results  obtained  from  the  set  of  five  hundred  true  ability  values 
themselves,  such  as  the  frequency  ratio  of  ^he  examinees  who  answered 
item  g correctly  to  the  total  group  of  examinees  for  each  of  the 
twenty  subintervals  of  9 , etc.,  which  are  not  observable  in  the 
empirical  situation.  In  fact,  the  frequency  ratios  obtained  by  the 
Two-Parameter  Beta  Method  turned  out  to  give  closer  estimates  of 
the  true  item  characteristic  functions  than  those  obtained  directly 
from  the  set  of  five  himdred  true  ability  scores,  and  the  estimated 
parameter  values  in  the  normal  ogive  model  obtained  by  the  IVo- 
Parameter  Beta  Method  are  as  good  as  those  obtained  directly  from  the 
ability  scores  (cf.  Samejima,  1977d,  Figures  7-9  and  8-3,  Tables  7-1, 
7-2,  8-1  and  8-2). 

It  is  a logical  speculation,. . therefore,  that  the  use  of  the 
total  conditional  probability  density  function  of  6 , given  6 , 
instead  of  the  calibrated  8's  , may  still  Increase  the  accuracy 
of  the  estimation  of  the  operating  characteristics.  For  this  reason, 
the  second  half  of  the  present  study  is  focused  upon  investigating  this 
possibility,  as  well  as  finding  out  its  limitations. 

We  follow  the  same  procedure  up  to  the  fourth  step  described 
in  Section  II,  and  obtain  $(6(§^)  , the  conditional  density  function 
of  0 , given  § , which  is  estimated  by  the  Two-Parameter  Beta  Method. 

1 
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The  estimated  item  characteristic  function,  given  by 

N 

(4.1)  P-O)  - z $(e|9  )t  z $(e|§  )]'^ 

* sGG  ® S-1  ® 

where  s is  the  individual  examinee,  G is  the  group  of  examinees 
who  answered  item  g correctly,  N (•  500)  is  the  number  of  examinees 
in  the  total  group,  and  4)(6|6)  is  the  approximated  conditional 
density  function  given  by  (2.5)  . The  a priori  set  parameters,  ag 
and  bg  , for  the  Beta  distribution  are  the  same  as  those  used  in 
the  previous  study  (Samejlma,  1977d),  i.e.,  6 ± 0.54825  ; 

and  the  other  two  parameters,  pg  and  qg  , are  also  the  same  as 
before,  which  were  obtained  from  the  conditional  expectation  and 
variance  of  6 , given  6 , and  the  estimate  of  g(§)  which  were 
approximated  by  the  polynomial  of  degree  3 or  4,  through  equations 
(2.6)  and  (2.7). 

From  the  approximated  conditional  density  function,  <|)(6|6  ) , 

8 

we  can  obtain  an  estimate  of  the  probability  density  function  of  6 , 
such  that 

(4.2)  f(e)  - ^ <Ke|eg)  . 

s-l 

It  is  expected  that,  if  this  estimated  density  function  is  close  to 
the  true  density  function,  f(d)  , then  the  estimated  operating 
characteristics  obtained  by  the  method  in  question  will  be  reasonably 
close  to  the  true  operating  characteristics. 
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V Criterion  Item  Charac ^ ^ Function 

As  a criterion  for  eva-aatlng  the  resulting  estimates  of  the 
item  characteristic  function,  we  will  use  the  counterpart  of  (4.1) 
by  replacing  $(6|@)  by  the  true  conditional  density  of  6 , given 
§ , which  Is  given  by 

(5.1)  (Ke|§)  - t|;(§|e)  f(e)  [j  4/(§|9)  fO)  d0]‘^  . 

/ — oo 

where  41(6 1 6)  is  approximated  by  the  normal  density  with  8 and 
the  inverse  of  the  test  characteristic  function  1(9)  (*  21.63)  as  its 

two  parameters,  by  virtue  of  the  asymptotic  property  of  the  maximum 
likelihood  estimate  (Samejima,  1975,  1977a,  1977b),  and  f(9)  (-0.2) 

is  the  probability  density  function  of  9 . Hereafter,  this  function 
is  called  the  criterion  item  characteristic  function.  Just  like 
the  frequency  ratios  of  the  true  9 , the  criterion  item  characteristic 
function  is  not  observable  in  the  empirical  situation,  and  only  the 
simulation  study  can  receive  its  benefit. 

Note  that  ti  criterion  item  characteristic  function  does  depend 
upon  the  sampling  flucwuatlons  of  the  examinee's  item  responses  to  the 
binary  item.  In  this  sense,  this  criterion  Itself  will  show  the 
limitation  of  the  methods,  in  which  (4.1)  is  used  for  the  estimated 
item  characteristic  function,  however  refined  4i(0|6)  may  be. 

The  criterion  item  characteristic  function  is  also  beneficial 
in  the  sense  that  it  provides  us  with  a continuous  function,  rather 
than  a discontinuous  set  of  frequency  ratios,  in  the  type  of  research 
like  the  present  one.  It  will  be  fully  used  not  only  in  the  present 
study,  but  in  the  future  research  on  the  estimation  of  the  operating 


characteristics . 
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In  the  following  section,  the  criterion  Item  characteristic 
function  for  each  of  the  ten  binary  Items  Is  drawn  by  a thin,  solid 
line  to  be  compared  with  the  resulting  estimated  Item  characteristic 
functions  by  the  Conditional  P.D.F.  Method  of  the  TWo-Parameter 
Beta  Method.  The  set  of  frequency  ratios  obtained  from  the  five 
hundred  true  ability  scores  Is  also  presented  In  the  same  figures,  us- 
ing a broken  and  dotted  line. 
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VI  Results 

Figures  6-1  and  6-2  present  the  estimated  probability  density 
functions  of  ^ (solid  curve)  obtained  by  (4.2),  for  Degree  3 and  4 
Cases  respectively.  In  the  same  graphs,  the  theoretical  probability 
density  function,  f(8)  , (dashed  line)  and  the  frequency  ratios  of 
6 with  0.25  as  the  width  of  Intervals  (solid  line),  which  were  ob- 
tained previously  (Samejlma,  1977d),  are  also  drawn. 

As  Is  expected,  these  curves  are  very  close  to  the  corresponding 
histograms  of  6 , smoothing  out  Its  sampling  fluctuations  caused  by 
the  calibration  of  6 by  the  Monte  Carlo  method.  The  curves  are  not 
so  close  to  the  theoretical  density  f(9)  , however,  especially  for 
the  extreme  values  of  8 . This  Is  considered,  mainly,  to  be  due  to  the 
two  artificially  set  parameter  values  for  each  Beta  density  function. 

We  can  also  see  that  there  Is  a slight  difference  between  Degree  3 and 
4 Cases,  l.e.,  the  curve  Is  closer  to  f(8)  as  a whole  In  Degree  4 
Case  than  In  Degree  3 Case.  This  may  be  caused  partly  because  of  the 

A 

better  approximation  of  g(8)  in  Degree  4 Case;  at  the  same  time, 
however,  we  should  note  that  In  Degree  4 Case  seven  extreme  values  of 
8 were  excluded,  and  this  must  be  another  reason  for  the  better  fit 
In  Degree  4 Case  at  the  extreme  values  of  8 . . 

Figure  6-3  presents  the  estimated  Item  characteristic  functions 
obtained  by  the  Conditional  P.D.F.  Method  of  the  Two-Parameter  Beta 
Method,  In  Degree  3 Case  (broken  curve)  and  In  Degree  4 Case  (dashed 
curve),  together  with  the  theoretical  Item  characteristic  function 
(thick  solid  curve)  and  the  criterion  Item  characteristic  function 
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(chin  solid  curve),  for  each  of  the  ten  binary  Items.  In  the  same 
graphs,  the  frequency  ratios  of  the  hypothetical  examinees  who  answered 
the  Item  correctly  are  plotted  (broken  line  and  dots),  with  the  Interval 
length  of  0.25  for  the  range  of  6 -2.5  through  2.5  Inclusive, 

giving  each  subinterval  the  total  of  25  examinees. 

A striking  truth  of  these  results  Is  that,  for  a substantial 
range  of  9t  the  resulting  estimated  Item  characteristic  functions  are 
almost  Identical  with  the  corresponding  criterion  Item  characteristic 
functions.  In  fact.  If  we  take  the  range  of  9 , -2.0  through  2.0  , 

the  two  curves  for  the  estimated  Item  characteristic  functions  In  Degree 
3 and  4 Cases  are  almost  indistinguishable  from  the  criterion  curve  for 
every  binary  Item.  This  Is  more  conspicuously  shown  In  such  Items  as 
Item  4 and  item  10  for  which  the  criterion  Item  characteristic  function 
deviates  substantially  from  the  true  Item  characteristic  function,  al- 
though for  other  Items  like  Item  3,  Item  5,  Item  6,  Item  7 and  Item  9, 
for  which  the  deviation  of  the  criterion  item  characteristic  function 
from  the  true  Item  characteristic  function  Is  relatively  small,  the  two 
estlisated  Item  characteristic  functions  In  Degree  3 and  4 Cases  literally 
trace  the  criterion  Item  characteristic  function. 

We  notice  that,  In  spite  of  the  deviations  of  f(9)  obtained  by 
the  TWo-Parameter  Beta  Method  from  the  true  density  function  f(e)  , 
the  estimated  Item  characteristic  functions  In  both  Degree  3 and  4 Cases 
are  almost  as  good  as  the  criterion  Item  characteristic  functions.  This 
fact  makes  us  wonder  how  strongly  the  discrepancies  between  the  estimated 
probability  density  function  of  0 by  the  Conditional  P.D.F.  Method  and 
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the  theoretical  probability  density  fiinction,  which  are  illustrated  in 
Figures  6-1  and  6-2,  are  really  affected  by  the  way  we  approximated  the 
conditional  distribution  by  a Beta  distribution;  or  are  they  more  affected 
by  the  saiig>ling  fluctuations  of  the  item  responses?  To  answer  this  ques- 

A .A 

tion.  Figure  6-4  presents  the  function  given  by  (4.2),  where  ^(e|e  ) 
is  replaced  by  the  true  conditional  density  function  obtained  by  (5.1). 

It  is  clear  from  this  figure  that,  although  at  the  extreme  ends  of  6 
this  curve  shows  a substantial  difference  from  the  two  curves  obtained 
by  the  Conditional  P.D.F.  Method  of  the  Two-Parameter  Beta  Method,  for 
the  intermediate  values  of  6 it  shows  fairly  similar  variations  as  are 
seen  in  the  results  of  the  other  two  curves:  evidence  which  con- 

firms the  above  suspicion. 

The  usual  simple  least  square  method  was  applied  in  estimating 
the  discrimination  and  difficulty  parameters  of  the  normal  ogive  model,  a 

g 

and  . Table  6-1  and  Table  6-2  present  the  results  in  Degree  3 and  4 
Cases  and  for  the  criterion  item  characteristic  functions,  using  the  inter- 
val of  the  ordinate  0.05  and  0.95  Inclusive,  within  the  interval  of  6 
-4.0  and  4.0  Inclusive.  We  notice  that  for  some  items,  lllce  item  2,  the 
values  of  the  estimated  parameters  are  substantially  different  from 
the  true  parameters,  and  also  from  those  for  the  criterion  item  cha- 
racteristic functions,  whereas  for  some  other  items,  like  item  8,  they 
are  close. 

These  large  differences  are  due  to  the  deviations  of  the  esti- 
mated item  characteristic  functions  at  the  extreme  values  of  e , as 
is  hinted  in  Figure  6-3  . Tables  6—3  and  6-4  present  similar  estimates 


FIGURE  6-4 

Approxlaated  Marginal  Density  Function  of  0 Obtained  from  the  500  Conditional  Density 
Functions  of  0 , Given  Its  Maximum  Likelihood  Estimate  (Solid  Curve)  and  the  Theoretical 

Density  Function  of  0 (Dashed  Line) 


i 


-83- 

obtalned  by  using  the  Interval  of  0 , -2.4  and  2.4  lotlusive.  In- 

stead of  [-4.0,  4.0]  . Ve  note  that  the  values  of  the  estimated  para- 
meters for  the  criterion  item  characteristic  functions  are  identical  with 
those  in  Tables  6—1  and  6—2  • This  is  from  the  fact  that  the  functions 
assume  xero  and  unity  outside  the  range  of  9 -2.5  through  2.5  res- 
pectively. The  estimates  of  a and  b in  Degree  3 and  4 Cases  shown 

8 8 

in  these  two  tables  are,  as  a whole,  much  closer  to  the  true  parameter 
values,  and  quite  comparable  with  those  obtained  from  the  criterion 
item  characteristic  functions.  For  additional  information,  the  com- 
plete  lists  of  the  estimated  parameters,  which  Include  the  cases  where 
the  interval  of  the  ordinate  is  0.15  through  0.85  , 0.10  through 

0.90  , and  0.01  through  0.99  , are  presented  in  Appendix  II,  as 
Tables  A-2-1  and  A-2-2  when  the  range  of  9 is  -4.0  through  4.0  , 
and  as  Tables  A-2-3  and  A-2-4  when  the  range  of  9 is  -2,4  through 
2.4  . 


I 


< 

i 1 


TABLE  6-1 

Discrimination  Parameter  and  Its  Estimates  of  Each  of  the  Ten 
Binary  Items  Obtained  from  the  Estimated  Item  Characteristic 
Functions  by  the  Conditional  P.D.F.  Method  of  the  Two-Parame- 
ter Beta  Method,  with  the  Corresponding  Estimate  Obtained  from 
the  Criterion  Item  Characteristic  Function:  The  Range  of  Each 
Estimated  Item  Characteristic  Function  Used  Is  [0.05,  0.95], 
within  the -Interval  of  0 , [-4.0,  4.0] 


METHOD 


TRUE  A from  & from  A from 

Egg 
a DGR. 3 DGR. 4 CRITERION 

g 


ITEM 

g 

1 

0.05- 

0.95 

0.05- 

0.95 

0.05- 

0.95 

1 

1.5 

1.323 

1.289 

1.400 

1 

5 

2 

1.0 

0.349 

0.608 

1.024 

3 

2.5 

1.314 

1.222 

1.788 

4 

1.0 

0.787 

0.812 

0.868 

5 

1.5 

1.394 

1.378 

1.368 

6 

1.0 

0.906 

0.602 

0.895 

7 

2.0 

1.489 

1.480 

1.473 

8 

1.0 

0.911 

0.919 

0.886 

® i 

2.0 

1.676 

1.709 

1.716 

10 

1.0 

0.659 

0.724 

0.725 

The  n\imber  of  Intervals  used  in  estimation  Is  shown  as  a subscript 
when  It  Is  less  than  6. 
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TABLE  6-2 

Difficulty  Parameter  and  Its  Estimates  of  Each  of  the  Ten  Bin- 
ary Items  Obtained  from  the  Estimated  Item  Characteristic  Fun- 
ctions by  the  Conditional  P.D.F.  Method  of  the  Two-Parameter 
Beta  Method,  with  the  Corresponding  Estimate  Obtained  from  the 
Criterion  Item  Characteristic  Function:  The  Range  of  Each  Es- 
timated Item  Characteristic  Function  Used  Is  [0.05,  0.95], 


within  the 

Interval  of 

8 . [-4.0, 

4.0]. 

METHOD 

TRUE 

b 

8 

6 from 

8 

DGR.3 

S from 

8 

0GR.4 

& from 

8 

CRITERION 

ITEM 

0.05- 

0.95 

0.05- 

0.95 

0.05- 

0.95 

1 

-2.5 

-2.684 

-2.737 

-2.651 

5 

2 

-2.0 

-3.310 

-2.478 

-2.002 

3 

-1.5 

-1.527 

-1.552 

-1.507 

4 

-1.0 

-1.071 

-1.056 

-1.005 

5 

-0.5 

-0.462 

-0.466 

-0.472 

6 

0.0 

-0.074 

0.047 

-0.075 

7 

0.5 

0.523 

0.528 

0.527 

8 

1.0 

0.958 

0.953 

0.981 

9 

1.5 

1.505 

1.506 

1.502 

10 

2.0 

2.259 

2.170 

2.118 

The  number  of  intervals  used  in  estimation  is  shown  as  a subscript 
when  it  is  less  than  6. 
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TABLE  6-3 

Discrimination  Parameter  and  Its  Estimates  of  Each  of  the  Ten 
Binary  Items  Obtained  from  the  Estimated  Item  Characteristic 
Fiinctlons  by  the  Conditional  P.D.F.  Method  of  the  Two-Parame- 
ter Beta  Method,  with  the  Corresponding  Estimate  Obtained  from 
the  Criterion  Item  Characteristic  Function:  The  Range  of  Each 
Estimated  Item  Characteristic  Function  Used  Is  [O.OS,  0.95], 


within 

the  Interval  of 

e , [-2.4, 

2.4]. 

METHOD 

TRUE 

A from 

& from 

A from 

g 

g 

g 

a 

g 

DGR.3 

DGR.4 

CRITERION 

ITEM 

0.05- 

0.05- 

0.05- 

0.95 

0.95 

0.95 

1 

1.5 

1.119 

1.112 

1.400 

5 

5 

S 

2 

1.0 

1.051 

1.067 

1.024 

3 

2.5 

1.785 

1.777 

1.788 

4 

1.0 

0.887 

0.875 

0.868 

5 

1.5 

1.394 

1.378 

1.368 

6 

1.0 

0.906 

0.897 

0.895 

7 

2.0 

1.489 

1.480 

1.473 

8 

1.0 

0.911 

0.919 

0.886 

9 

2.0 

1.676 

1.709 

1.716 

10 

1 

1.0 

0.762 

0.769 

0.725 

The  number  of  Intervals  used  In  estimation  Is  shown  as  a subscript 
when  it  is  less  than  6. 


TABLE  6-4 


Difficulty  Parameter  and  Its  Estimates  of  Each  of  the  Ten  Bin- 
ary' Items  Obtained  from  the  Estimated  Item  Characteristic  Fun- 
ctions by  the  Conditional  P.D.F.  Method  of  the  Two-Parameter 
Beta  Method,  with  the  Corresponding  Estimate  Obtained  from  the 
Criterion  Item  Characteristic  Function:  The  Range  of  Each  Es- 
timated Item  Characteristic  Function  Used  Is  [0.05,  0.9^5], 


• 

within  the 

Interval  of 

e , [-2.4, 

2.4]. 

METHOD 

TRUE 

b from 

& from 

S from 

g 

g 

g 

b 

g 

DGR.3 

DGR.4 

CRITERICW 

ITEM 

0.05- 

0.05- 

0.05- 

0.95 

0.95 

0.95 

1 

-2.5 

-2.815 

-2.837 

-2.651 

5 

5 

5 

2 1 

-2.0 

-1.968 

-1.965 

-2.002 

3 

-1.5 

-1.494 

-1.505 

-1.507 

4 

’-1.0 

-0.992 

-1.001 

-1.005 

5 

-0.5 

-0.462 

-0.466 

-0.472 

6 1 

0.0 

-0.074 

-0.075 

-0.075 

7 

0.5 

0.523 

0.528 

0.527 

8 

1.0 

0.958 

0.953 

0.981 

9 

1.5 

1.505 

1.506 

1.502 

10 

2.0 

2.108 

2.104 

2.118 

The  number  of  intervals  used  in  estimation  is  shown  as  a subscript 
when  it  is  less  than  6. 
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VII  Dlsctiaslon  and  Conclua  ion 

Two  variations  of  the  Two-Parameter  Beta  Method  for  estimating 
the  operating  characteristics  of  the  item  score  categories  have  been 
presented,  and  vaed  for  the  same  simulated  data  of  five  hundred 
hypothetical  examinees,  whose  ability  distributes  uniformly.  In 
estimating  the  item  characteristic  functions  of  the  ten  binary  Items 
which  follow  the  normal  ogive  model.  The  criterion  operating 
characteristic  has  been  Introduced,  and  used  for  evaluating  the 
results  of  the  Conditional  P.D.F.  Method. 

In  spite  of  the  fact  that  the  approximation  by  a polynomial  of  deg' 
ree  3 has  a theoretical  disadvantage  compared  with  the  approximation  by 

a polynomial  of  degree  4,  so  far  we  have  found  very  little  evidence 
to  support  Degree  4 Case  in  preference  to  Degree  3 Case.  Any  hasty 
conclusions  should  be  avoided,  however,  until  we  have  tried  these 
methods  on  different  types  of  data,  e.g.,  those  of  a smaller  or 
larger  number  of  examinees,  those  on  different  ability  distributions, 
those  for  estimating  the  operating  characteristics  other  than  those 
in  the  normal  ogive  model  of  the  dichotomous  response  level,  and  so 
forth. 

The  results  of  the  Conditional  P.D.F.  Method  suggest  that  the 
a priori  set  functional  formula  for  the  conditional  distribution  of 
ability,  given  its  maximum  likelihood  estimate,  may  not  be  so  Important 
as  it  looks,  and  some  other  functional  formulas  might  serve  as  well. 
Investigations  in  this  direction  will  be  made  in  the  near  future. 

The  fact  that  in  both  Degree  3 and  4 Cases  the  estimated  item 
characteristic  function  turned  out  to  be  extremely  close  to  the 
criterion  item  characteristic  function  indicates  the  limitation  of 
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che  Conditional  P.D.F.  Method,  as  well  as  Its  accuracy  of  estimating 
the  operating  characteristics.  For  this  reason,  not  only  we  should 
try  to  refine  the  present  method,  but  also  we  should  explore  new 
methods  which  will  eliminate  the  strong  effect  of  the  sampling 
fltx:tuatlons  of  the  Item  responses.  This  can  be  done  by  approximating 
the  bivariate  distribution  of  ability  and  Its  maximum  likelihood 
estimate  for  each  Item  score  group,  the  Idea  which  underlies  the 
Normal  Approximation  Method.  Investigations  will  be  made  In  this 
direction  also  In  the  near  future. 


71!  71! 
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FIGURE  A-1-2  (Continued) 
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